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Intelligent Spam Detection System Using An 
Updateable Neural Analysis Engine 



Field of the Invention 

The present invention relates to SPAM detection methods, and more particularly 
to intelligently detecting and removing SPAM. 

Background of the Invention 

The rapid increase in the number of users of electronic mail and the low cost 
of distributing electronic messages, for example, via the Internet and other 
communications networks has made mass marketing via electronic mail ("e-mail") 
an attractive advertising medium. Consequently, e-mail is now frequently used as the 
medium for widespread marketing broadcasts of unsolicited messages to e-mail 
addresses, commonly known as "SPAM." 

Electronic mass marketers (also called "spammers") use a variety of 
techniques for obtaining e-mail address lists. For example, marketers obtain e-mail 
addresses from postings on various Internet sites such as news group sites, chat room 
sites, or directory services sites, message board sites, mailing lists, and by identifying 
"mailto" address links provided on web pages. Using these and other similar 
methods, electronic mass marketers may effectively obtain large numbers of mailing 
addresses, which become targets for their advertisements and other unsolicited 
messages. 
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Users of Internet services and electronic mail, however, are not eager to have 
their e-mail boxes filled with unsolicited e-mails. This is an increasing problem for 
Internet service providers (ISPs) such as America Online (AOL®) or Microsoft 
Network (MSN®) and other entities with easily identifiable e-mail addresses such as 
5 large corporations (e.g., IBM®, Microsoft®, General Motors®, etc.). ISPs object to 
junk mail because it reduces their users 1 satisfaction of their services. Corporations 
want to eliminate junk mail because it reduces worker productivity. 

To date, the prior art has been devoid of mechanisms that can block SPAM 
10 effectively. Traditionally, SPAM detection has been based around specific rules for 
detecting it. Such rules include searching for key phrases in the subject headers, 
determining whether the recipient is actually on the list of users to receive the e- 
mail, etc. 

15 More particularly, prior art systems rely on a set rule-base or on blocking 

based on simple mail fields such as sender, subject, mail body, etc. As Spammers 
become more creative in their mailings, it is increasingly difficult to block unwanted 
messages based on fixed information. Text search mechanisms in particular often 
generate a number of misses or "falses" due to the limitations of the searching 

20 mechanisms. 

In particular, such text search mechanisms traditionally utilize static logic for 
locating particular known strings. This is often insufficient due to the dynamic 
manner in which spamming methods change over time. Thus, what is needed is a 
25 process for dynamically and intelligently detecting unwanted SPAM electronic mail 
messages. 
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Disclosure of the Invention 

A system, method and computer program product are provided for detecting 
an unwanted message. First, an electronic mail message is received. Text in the 
5 electronic mail message is decomposed. Statistics associated with the text are 
gathered using a statistical analyzer. A neural network engine coupled to the 
statistical analyzer is taught to recognize unwanted messages based on statistical 
indicators. The statistical indicators are analyzed utilizing the neural network engine 
for determining whether the electronic mail message is an unwanted message. 

10 

As mentioned above, the neural network engine can be taught to recognize 
unwanted messages. In one process of teaching the neural network, examples are 
provided to the neural network engine. The examples are of wanted messages and 
unwanted messages. Each of the examples is associated with a desired output. Each 
15 of the examples is processed with statistics by the neural network engine for 

generating weights for the statistics. Each of the weights is used to denote wanted 
and unwanted messages. Preferably, the neural network engine utilizes adaptive 
linear combination for adjusting the weights. Logic associated with the neural 
network engine is updated based on the processing by the neural network engine. 

20 

In another process for teaching the neural network engine, the neural network 
engine is updated to recognize an unwanted message. The message is identified as 
an unwanted message. The features of the message that make the message unwanted 
are identified, wherein the identified features are stored and used by the neural 
25 network to identify subsequent unwanted messages. Preferably, a graphical user 
interface is provided for allowing a user to identify the features of the message that 
make the message unwanted. 
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In another aspect of the present embodiment, the neural network engine uses 
artificial intelligence that analyzes previous user input for determining whether the 
message is unwanted. 



5 A system, method and computer program product are also provided for 

teaching a neural network engine to recognize an unwanted message. Examples are 
provided to a neural network engine. The examples are of wanted messages and 
unwanted messages. Each of the examples is associated with a desired output. Each 
of the examples is processed with statistics for generating weights for the statistics. 
10 Each of the weights is used to denote wanted and unwanted messages. Logic 

associated with the neural network engine is updated based on the processing by the 
neural network engine. 



In one aspect of the present embodiment, the neural network engine utilizes 
1 5 adaptive linear combination for adjusting the weights. In another aspect of the 

present embodiment, the neural network engine is updated to recognize an unwanted 
message. The message is identified as an unwanted message. The features of the 
message that make the message unwanted are identified. The identified features are 
stored and used by the neural network to identify subsequent unwanted messages. 
20 Preferably, a graphical user interface is provided for allowing a user to identify the 
features of the message that make the message unwanted. 



WO 03/010680 



5 



PCT/US02/23811 



Brief Description of the Drawings 

Figure 1 illustrates a network architecture in accordance with the one 
5 embodiment. 

Figure 2 shows a representative hardware environment associated with the 
computers of Figure 1. 

10 Figure 3 is a flow diagram of a process for detecting an unwanted message. 

Figure 4 depicts an illustrative architecture according to an embodiment. 

Figure 5 is a flowchart of a process for teaching a neural network engine to 
15 recognize an unwanted message. 

Figure 6 is a flow diagram depicting processing performed by the neural 
network engine. 

20 Figure 7 illustrates a system for allowing a user to teach the neural network 

engine to recognize unwanted messages. 

Figure 8 illustrates a user interface used in conjunction with the system of 
figure 7. 



25 
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Description of the Preferred Embodiments 

Figure 1 illustrates a network architecture 100, in accordance with the one 
5 embodiment. As shown, computers 102 of remote users are connected to a network 
104. The remote users send electronic mail messages (e-mail) to local users, who 
receive them on computers 106. In the context of the present network architecture, 
the network may take any form including, but not limited to a local area network 
(LAN), a wide area network (WAN) such as the Internet, etc. The computers can 
10 include a desktop computer, laptop computer, hand-held computer, etc. 

The e-mail passes through a gateway 108 which analyzes the messages to 
determine whether they are SPAM prior to allowing the message to pass to the local 
users. In one embodiment, the gateway 108 may include an Internet gateway, 
15 intranet gateway, Internet proxy, intranet proxy, or any other type of intermediate 

device. The gateway includes a statistical analyzer 110 and a neural network engine 
112. 

In use, the statistical analyzer decomposes the electronic messages to 
20 determine an amount of various SPAM indicators (i.e. capitalization, punctuation, 
URLs, phone numbers, etc.). Instead of using static rule-based logic to accept or 
reject the electronic messages based on the results of the statistical analyzer, the 
results of the parsing are passed to the neural network engine. The neural network 
engine is first "taught" the difference between SPAM and non-SPAM electronic 
25 messages based on the aforementioned SPAM indicators. This is done by 

processing hundreds of electronic messages that are known to be SPAM or known to 
be non-SPAM. Once taught, the neural network engine can be used in combination 
with the statistical analyzer to accept or deny electronic messages. An administrator 
114 in communication with the gateway can be allowed to manipulate operation of 
30 the gateway and its components. 
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Figure 2 shows a representative hardware environment that may be 
associated with the remote source 102 and/or target 106 of Figure 1, in accordance 
with one embodiment. Such figure illustrates a typical hardware configuration of a 
workstation in accordance with a preferred embodiment having a central processing 
5 unit 210, such as a microprocessor, and a number of other units interconnected via a 
system bus 212. 

The workstation shown in Figure 2 includes a Random Access Memory 
(RAM) 214, Read Only Memory (ROM) 216, an I/O adapter 218 for connecting 

10 peripheral devices such as disk storage units 220 to the bus 212, a user interface 
adapter 222 for connecting a keyboard 224, a mouse 226, a speaker 228, a 
microphone 232, and/or other user interface devices such as a touch screen (not 
shown) to the bus 212, communication adapter 234 for connecting the workstation to 
a communication network 235 (e.g., a data processing network) and a display 

15 adapter 236 for connecting the bus 212 to a display device 238. 

The workstation may have resident thereon an operating system such as the 
Microsoft Windows NT or Windows/95 Operating System (OS), the IBM OS/2 
operating system, the MAC OS, Linux or other UNIX operating system. It will be 
20 appreciated that a preferred embodiment may also be implemented on platforms and 
operating systems other than those mentioned. A preferred embodiment may be 
written using JAVA, C, and/or C++ language, or other programming languages, 
along with an object oriented programming methodology. Object oriented 
programming (OOP) has become increasingly used to develop complex applications. 

25 

Spam Detection 



30 



Figure 3 is a flow diagram of a process 300 for detecting an unwanted 
message. In operation 302, an electronic mail message is received. Text in the 
electronic mail message is decomposed in operation 304. Statistics associated with 
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the text are gathered in operation 306 using a statistical analyzer. This may be 
accomplished utilizing various methods such as analyzing character type, i.e. 
Unicode, etc. In operation 308, a neural network engine coupled to the statistical 
analyzer is taught to recognize unwanted messages based on statistical indicators. 
5 The statistics are analyzed in operation 310 utilizing the neural network engine for 
determining whether the electronic mail message is an unwanted message. 

According to another embodiment, a method is provided by which it is 
possible to accurately detect SPAM by applying statistical calculations against the 

1 0 text content of the message and supply the results of the application of the 

calculations and the message text contents itself into a neural network engine. The 
neural network then attempts to determine whether the message is SPAM or not 
based on what the neural network has learned in the past and/or by comparison with 
an existing set of known SPAM. An additional mechanism can be provided so that a 

1 5 user can return a message to the SPAM engine and mark it as SPAM (or not SPAM) 
to provide the engine with an on-going learning capability. 

Architecture 

20 Figure 4 depicts an illustrative gateway architecture 400 according to an 

embodiment. The e-mail 402 arrives from the Internet 404 at the word decomposer 
406, which breaks the text content into words and punctuation. The parsed text is 
then supplied to the statistical analyzer 408 which creates a table of variables 
relating to the message, e.g.: Total Number of words, Number of words capitalized, 

25 Punctuation to word ratio etc. See Table 1 and related discussion, below. 

This table along with the decomposed word list is supplied to the neural 
network 410 which provides a weighting, or probability, that the message is SPAM, 
partially based on known patterns of SPAM messages stored in a pattern database 
30 412. If the message is determined to be SPAM, it is quarantined in a quarantine 
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database 414. If the message is determined not to be SPAM, the message is sent to a 
mail forwarder 416, which forwards the message to a user 418. 

Statistical Word Analyzer 

The Statistical Word Analyzer attempts to build some key facts about the text 
content. The facts are based on certain characteristics that users/administrators have 
determined could represent SPAM. A results table from analysis of the message is 
then built. 

At that point, the program segment shown in Example 1 may be used. 



Example 1 

15 If words__capitalized >5% and total_ punc>2 0% 

And perc_plink >50% And URL analysis fails 
And E-mail address analysis fails Then 
This is spam 

Else 

20 This is not Spam 

End If 

However, questions may arise as to whether the above analysis is accurate for 
all SPAM, whether the percentages are right, or whether the test variables are the 
25 correct ones to use. 

Because this task is difficult to do using a fixed algorithm, the statistical 
analysis ends at this point and this data is passed to the Neural Network engine to 
determine patterns in statistics and words, and use these to determine whether the 
30 message is SPAM based on comparing the patterns to patterns predetermined to be 
SPAM or non-SPAM. The greater the number of variables in the statistics table, the 
easier it is for the Artificial Intelligence engine (AI) to "learn" to differentiate 
between SPAM and genuine messages. 
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The AI solution provides two goals. In particular, the AI is used to produce a 
set of rules that can be used in an existing AI engine. Further, the AI engine is used 
as a standalone gateway for determining which messages are SPAM. 

Table 1 illustrates various steps and/or functions to achieving these goals. 

Table 1 

Pre -train the system with known SPAM from an archive of 
known SPAM, such as http://www.annexia.org/spam/ "The Great 
Spam Archive" . 

User Interface (UI) modification of the AI . A point and 
click UI where an existing e-mail is selected and displayed 
to be made an example of. Rules are constructed from the 
areas of the example mail that classify it as SPAM . See the 
section entitled Feature Editor Applet, below for a more 
detailed description. 

Application of a tiered approach, in which the user sends 
the message to an administrative area. An administrator 
utilizes the UI to notify the engine. 

Artificial Intelligence introduced to make decisions based 
on previous Administrative input. This may include 
statistical or pattern-matching intelligence and would 
automatically update the rule-base. 

Artificial Intelligence taken a level further, where 
grammatical and language decisions are made based on 
previous human- input to provide automatic generation of a 
rule-base . 

System opened up to internal users with encryption method 
for trusted SPAM identification by the users. 
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Neural Network Engine 

The statistics table is passed as inputs to the Artificial Neural Network 
(ANN). The preferred ANN is a Supervised Learning type, though other types can 
5 be used. In this type, a "teacher" (user, administrator, or computer application) 
shows examples of inputs that the engine will receive paired with desired outputs. 
An error value is produced between the desired and actual responses, which should 
reduce as the learning progresses. 

1 0 Figure 5 is a flowchart of a process 500 for teaching a neural network engine 

to recognize an unwanted message. In operation 502, examples are provided to a 
neural network engine. The examples are of wanted messages and unwanted 
messages. Each of the examples is associated with a desired output. In operation 
504, each of the examples is processed with statistics for generating weights for the 

1 5 statistics. Each of the weights is used to denote wanted and unwanted messages. 
Logic associated with the neural network engine is updated in operation 506 based 
on the processing by the neural network engine. 

In the SPAM context, there are two sets of inputs: First, an archive 
20 containing only SPAM is inputted, and secondly an archive containing only genuine 
(non-SPAM) messages is inputted. Known SPAM can be obtained from various 
online resources (http://www.annexia.org/spam/ "The Great Spam Archive"). The 
teacher may automatically and randomly pick messages from either archive and 
supply them (with the statistical table) to the ANN together with a value for SPAM 
25 or non-SPAM. This continues until the archives are exhausted. A very large data 
set possibly consisting of over 500,000 messages is preferred. 

Each input message is expressed as a vector, each variable from the statistic 
table being a feature variable in the vector. Note Table 2. 

30 



Table 2 
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Capitals 
X= Punctuation 
Dollars 

5 

Figure 6 is a flow diagram depicting processing performed by the ANN. The 
most appropriate form of ANN would be an Adaptive Linear Combiner (ALC) 600, 
which allows the presentation of input vectors X 602 and desired responses d 604 to 
10 the ALC. This will adjust weights until outputs a 606 are close to the desired 
responses. 

After the learning process has taken place, the Engine can be deployed into 
the gateway situation. All associated vectors, matrices and weights to be used with 
1 5 the ALC can be stored permanently on disk. 

The gateway could also be used to monitor intercommunication between 
Internet servers for tighter company-wide security, such as for preventing outbreak, 
SPAM, hacking attempts, etc. Such functionality can extend to the whole Internet 
20 community. 

In addition to the pre-learning, there can also be an interactive learning 
mechanism while the gateway is active. This is discussed in the next section. 

25 User Teaching Mechanism 

Some users of the system will be given permissions to allow teaching the 
Spam Engine when new mail arrives. Figure 7 illustrates the system that allows 
such teaching. 

30 

A typical sequence of actions using the system would be as follows. E-mail 702 
is received by a user teacher 704, but the e-mail has been marked incorrectly as 
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clean. The user returns the e-mail 706 back to the Gateway 708 but indicates the 
desired response as "Spam." A learner 710 in the Gateway receives the e-mail and 
desired response. The Gateway stores the e-mail creating a unique ID for it. The 
Gateway sends a new e-mail 712 to the user. The new e-mail contains a unique ID 
5 contained within a URL to the Feature Editor (Java Applet) 714. The user clicks on 
URL and indicates which features of the text of the e-mail make it SPAM. 
Preferably, the user is allowed to select the parts of the mail that clearly define it to 
be SPAM/offensive (e.g. subject, body, sender, attachments etc) and then within 
each part refine it down to the data elements to search on for CF filtering (set of 
10 words, file name / type for attachment, sender domain etc). The results of the user 
input are passed into a feature queue 716. 

The new features are not applied directly to the Statistical Analyzer/AI Engine 
718 in one embodiment because the ANN would have to be reset and re-learn its 

15 mail archive 720. Instead, a batch teacher 722 schedules an entire learning run at a 
scheduled interval with additional features from the feature queue. A secondary 
ANN 724 is used so that the primary ANN can stay active and continue to process e- 
mail while the secondary ANN is learning. When the learning process is finished, 
the primary ANN becomes the secondary ANN and vice versa. The AI Engine is 

20 then initialized with the new features. 



The AI engine could optionally be configured to divide SPAM into different 
confidence levels, e.g. Definitely SPAM, definitely not SPAM and possibly SPAM. 
The possibly SPAM would still be forwarded to the recipient, but also copied to a 
25 user teacher for feeding manual categorization in future training. 



Feature Editor Applet 



30 



Figure 8 illustrates the user interface 800 for applying features to the text of 
an e-mail. The user is able to highlight areas of text and select (right click) from a 
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pop up menu 802 a measurement to apply to this text. Such measurements are 
shown in Table 3. 

Table 3 

Contains a single instance of word - true/false 
Contains multiple instances of word = true/false 
Count number of occurrences of word = Number 
Ratio of word to total number of words = Percentage 

Selection of multiple words is also possible. Additional measurements will then 
be applied. Note Table 4. 

Table 4 

All words must exist = true/false 
Some words can exist = true/false 

Multiple instances of all words must exist = true/false 
Multiple instances of some words must exist = true/false 
Ratio of all words to total number of words = Percentage 
Ratio of some words to total number of words = 
Percentage 

When the OK button 804 is selected, these additional measurements are added to 
the feature queue for future inclusion in a batch teaching session. It is also possible 
for an administrator to review and edit the additional features before a batch run 
takes place. 

While various embodiments have been described above, it should be 
understood that they have been presented by way of example only, and not 
limitation. Thus, the breadth and scope of a preferred embodiment should not be 
limited by any of the above-described exemplary embodiments, but should be 
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defined only in accordance with the following claims and their equivalents. 
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Claims 

What is claimed is: 

1 . A method for detecting an unwanted message, comprising: 

(a) receiving an electronic mail message; 

(b) decomposing text in the electronic mail message; 

(c) gathering statistics associated with the text using a statistical analyzer; 

(d) teaching a neural network engine coupled to the statistical analyzer to recognize 
unwanted messages based on statistical indicators; and 

(e) analyzing the statistical indicators utilizing the neural network engine for 
determining whether the electronic mail message is an unwanted message. 

2. The method as recited in claim 1 , wherein the neural network engine is taught to 
recognize unwanted messages by providing examples to the neural network 
engine, wherein the examples are of wanted messages and unwanted messages, 
wherein each of the examples is associated with a desired output. 

3. The method as recited in claim 2, wherein each of the examples are processed 
with statistics by the neural network engine for generating weights for the 
statistics, wherein each of the weights is used to denote wanted and unwanted 
messages. 

4. The method as recited in claim 3, wherein the neural network engine utilizes 
adaptive linear combination for adjusting the weights. 

5. The method as recited in claim 3, wherein logic associated with the neural 
network engine is updated based on the processing by the neural network engine. 
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6. The method as recited in claim 1 , wherein the teaching of the neural network 
engine includes identifying a message as an unwanted message, the features of 
the message that make the message unwanted are identified, and the identified 
features are stored and used by the neural network to identify subsequent 
unwanted messages. 

7. The method as recited in claim 6, wherein a graphical user interface is provided 
for allowing a user to identify the features of the message that make the message 
unwanted. 

8. The method as recited in claim 1 , wherein the neural network engine analyzes 
previous user input for determining whether the message is unwanted. 

9. The method as recited in claim 1, wherein the statistics gathered using the 
statistical analyzer include results of an analysis of character type. 

10. A computer program product for detecting an unwanted message, comprising: 

(a) computer code for receiving an electronic mail message; 

(b) computer code for decomposing text in the electronic mail message; 

(c) computer code for gathering statistics associated with the text using a statistical 
analyzer; 

(d) computer code for teaching a neural network engine coupled to the statistical 
analyzer to recognize unwanted messages based on statistical indicators; and 

(e) computer code for analyzing the statistical indicators utilizing the neural 
network engine for determining whether the electronic mail message is an 
unwanted message. 

11. A system for detecting an unwanted message, comprising: 
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(a) a statistical analyzer for gathering statistics associated with text retrieved from 
an electronic mail message; and 

(b) a neural network engine coupled to the statistical analyzer, the neural network 
engine capable of being taught to recognize unwanted messages based on 
statistical indicators; 

(c) wherein the neural network engine analyzes the statistical indicators for 
determining whether the electronic mail message is an unwanted message. 

12. A method for teaching a neural network engine to recognize an unwanted 
message, comprising: 

(a) providing examples to a neural network engine, wherein the examples are of 
wanted messages and unwanted messages, and each of the examples is 
associated with a desired output; 

(b) processing each of the examples with statistics for generating weights for the 
statistics, wherein each of the weights is used to denote wanted and unwanted 
messages; and 

(c) updating logic associated with the neural network engine based on the 
processing by the neural network engine. 

13. The method as recited in claim 12, wherein the neural network engine utilizes 
adaptive linear combination for adjusting the weights. 

14. The method as recited in claim 12, wherein the neural network engine is updated 
to recognize an unwanted message, the message is identified as an unwanted 
message, the features of the message that make the message unwanted are 
identified, and the identified features are stored and used by the neural network 
to identify subsequent unwanted messages. 
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15. The method as recited in claim 14, wherein a graphical user interface is provided 
for allowing a user to identify the features of the message that make the message 
unwanted. 

1 6. A computer program product for teaching a neural network engine to recognize 
an unwanted message, comprising: 

(a) computer code for providing examples to a neural network engine, wherein the 
examples are of wanted messages and unwanted messages, and each of the 
examples is associated with a desired output; 

(b) computer code for processing each of the examples with statistics for generating 
weights for the statistics, wherein each of the weights is used to denote wanted 
and unwanted messages; and 

(c) computer code for updating logic associated with the neural network engine 
based on the processing by the neural network engine. 

17. A system for teaching a neural network engine to recognize an unwanted 
message, comprising: 

(a) a neural network engine adapted to receive examples of wanted messages and 
unwanted messages, each of the examples being associated with a desired 
output, the neural network engine further adapted for processing each of the 
examples with statistics for generating weights for the statistics, each of the 
weights being used to denote wanted and unwanted messages; and 

(b) wherein the neural network engine is capable of being updated. 

18. A method for detecting an unwanted electronic mail message, comprising: 
(a) receiving user input via a graphical user interface, the user input including 

indicators of unwanted electronic mail messages, wherein the indicator for a 
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particular unwanted message includes at least one of a URL, a sender, and 
domain of a sender; 

(b) storing the user input indicators in a database; 

(c) receiving an electronic mail message prior to delivery of the electronic mail 
message to an intended recipient of the electronic mail message; 

(d) analyzing the electronic mail message for determining whether the electronic 
mail message is an unwanted message based on the user input indicators stored 
in the database; 

(e) sending the electronic mail message to the intended recipient if the electronic 
mail message is not determined to be unwanted; and 

(f) selecting a disposition of the electronic mail message if the electronic mail 
message is determined to be unwanted, the disposition including at least one of: 
not delivering the electronic mail message to the intended recipient, sending the 
electronic mail message to an administrator, and sending the electronic mail 
message to a quarantine. 

19. The method as recited in claim 1 8, and further comprising decomposing text in 
the electronic mail message, gathering statistics associated with the text using a 
statistical analyzer, and analyzing the statistical indicators utilizing a neural 
network engine for determining whether the electronic mail message is an 
unwanted message. 

20. A method for detecting an unwanted message, comprising: 

(a) receiving user input via a graphical user interface, the user input including 
indicators of unwanted electronic mail messages; 

(b) teaching a neural network engine to recognize unwanted messages based on the 
user input; 

(c) receiving an electronic mail message; and 
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(d) analyzing the electronic mail message utilizing the neural network engine for 
determining whether the electronic mail message is an unwanted message based 
on the user input; 

(e) sending the electronic mail message to the intended recipient if the electronic 
mail message is not determined to be unwanted; and 

(f) selecting a disposition of the electronic mail message if the electronic mail 
message is determined to be unwanted, the disposition including at least one of: 
not delivering the electronic mail message to the intended recipient, sending the 
electronic mail message to an administrator, and sending the electronic mail 
message to a quarantine. 

21. The method as recited in claim 20, and further comprising decomposing text in 
the electronic mail messages, gathering statistics associated with the text using a 
statistical analyzer, and analyzing the statistical indicators utilizing the neural 
network engine for determining whether the electronic mail message is an 
unwanted message. 
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Fig. 2 
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300 



RECEIVE AN ELECTRONIC MAIL MESSAGE 



302 



DECOMPOSE TEXT IN THE ELECTRONIC MAIL MESSAGE 



304 



GATHER STATISTICS ASSOCIATED WITH THE TEXT USING A 
STATISTICAL ANALYZER 



306 



TEACH A NEURAL NETWORK ENGINE COUPLED TO THE 
STATISTICAL ANALYZER TO RECOGNIZE UNWANTED MESSAGES 
BASED ON STATISTICAL INDICATORS 



308 



ANALYZE THE STATISTICS AND STATISTICAL INDICATORS 
UTILIZING THE NEURAL NETWORK ENGINE FOR DETERMINING 
WHETHER THE ELECTRONIC MAIL MESSAGE IS AN UNWANTED 

MESSAGE 
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Fig. 3 
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v. 



PROVIDE EXAMPLES TO A NEURAL NETWORK ENGINE, WHERE 
THE EXAMPLES ARE OF WANTED MESSAGES AND UNWANTED 
MESSAGES AND EACH OF THE EXAMPLES IS ASSOCIATED WITH A 

DESIRED OUTPUT 



PROCESS EACH OF THE EXAMPLES WITH STATISTICS FOR 
GENERATING WEIGHTS FOR THE STATISTICS, WHERE EACH OF 
THE WEIGHTS IS USED TO DENOTE WANTED AND UNWANTED 

MESSAGES 



UPDATE LOGIC ASSOCIATED WITH THE NEURAL NETWORK 
ENGINE BASED ON THE PROCESSING BY THE NEURAL NETWORK 

ENGINE 
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Fig. 5 
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desired signal + noise 
d(k)-s(k)+n(k) 



604 
602 



600 



error e(k) * d(k) - a(k) - $(k) 



606 




correlated with n(k) 



After convergence, e(k)= d(k) - a(k) = s(k) + n(k) - n(k) = s(k) 



Fig. 6 
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802 



Feature Applet 



There are other ways to make money! 

This product produces 50% of alt the money made on the Internet! 
Now for the first time, it is brought to you retail! People like you are making 
$600-$4,000 

per week in CASH with this product! 
No selling! Not MLM! All CAS|^!! 

Call Toll Free 1-888-804-6837 

Only a few people per area will] 
product! 

So act fast and be the first one 
it time you earn 

what you are worth? Aren't you tired of making someone else rich? Well, here is 

your chance to 

make YOU RICH!!! 

No selling! Not MLM! All CASH!! 




> this revolutionary 
jje the most money!!! Isn't 
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